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POLYPEPTIDES MIMICKING THE ACTIVITY OF HUMM 

ERYTHROPOIETIN 

The present invention refers to polypeptides having 
a sequence obtained by the informational spectra method 
and the ability of stimulating the production of 
reticulocytes and red blood cells from bone marrow cells 
5 as well as hemoglobin synthesis and iron uptake. 

Human erythropoietin (EPO), an hormone playing a 
fundamental role in erythropoiesis , is a glycoprotein 
having a molecular weight of about 34-38 kd, the primary 
structure of which is shown in Figure 1. 
10 EPO is presently obtained by recombinant DNA 

techniques using eukaryotic cells which can glycosylate 
the expression product. 

The availability of shorter polypeptides, possibly 
active even in unglycosylated form mimicking the 
15 biological activity of human EPO, would be an highly 
desirable goal allowing the convenient preparation by 
synthetic procedures. 

It has now been found that- polypeptides , having in 
the informational spectrum obtained by Fourier 
20 transformation according to the informational analysis 
method substantially the same frequencies of natural 
erythropoietin, have substantially the same activity of 
human erythropoietin. 

The informational analysis method (ISM), first 
25 disclosed by Veljkovic V. et al. in IEEE Trans. Biomed. 
Eng. 32, 337 (1985); Cancer Biochem. Biophys. 9, 139, 
1987; Phys. Rev. Lett. 29, 105, (1972) and Phys . Lett. 
45A, 41, (1973) the content of which is herein 
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incorporated by reference , is based on the analysis of 
the information encoded in primary structure which is 
expressed by molecular electric oscillations propagating 
through polar environment. Based on the previously 
5 demonstrated strong correlation between electron-ion 
interaction potential (hereinafter EIIP) [Veljkovic V., 
A theoretical approach to preselection of carcinogens 
and chemical carcinogenesis, Gordon & Breach Sci. Pub., 
New York, 1980; Politzer P. and Truhlar D. G. , Chemical 

10 applications of atomic and molecular electrostatic 
potential, Plenum Press, New York, 1981); Politzer P., 
Toxicol, Lett. 43, 227 (1988)] it has been proposed that 
information expressed by electric oscillations is 
encoded in protein primary structure by distribution of 

15 the values of EIIP of amino acids. 

According to this approach, the protein sequences 
are transformed into signals by assignment of numerical 
values to each amino acid. These values correspond to 
EIIP [Veljkovic V. and Slavic I., Phys . Rev. Lett., 29, 

20 105 (1972); Veljkovic V. Phys. Lett., 45A, 41 (1973)]. 
The signal obtained is than decomposed in periodical 
function by Fourier transformation. The result is a 
series of frequencies and their amplitudes. The obtained 
frequencies correspond to the distribution of structural 

25 motifs with defined physico-chemical characteristics 
responsible for biological function of protein. When 
comparing proteins which share the same biological or 
biochemical function, the technique allows detection of 
code/frequency pairs which are specific for their common 

30 biological properties. The method is insensitive to the 
location of the motifs and, thus, does not require 
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previous alignment of the sequence. The ISM was 
successfully applied in structure/function analysis and 
<te novo design of peptides [Cosic I. and Nesis D. , 
Eur. J. Biochem;, 170, 247 (1988); Skerl V., and Pavlovic 
5 M., FEBS Lett., 239, 1411 (1988); Veljkovic V. and 
Metlas R . , Cancer Biochem. Biophys., 10, 191 (1988); 
Cosic I. et al., Biochemie, 71, 333 (1989); Lalovic D. 
and Veljkovic V., Biosystems> 23, 311 (1989); Cosic I., 
Resonant recognition model of protein-protein and 

10 protein-DNA recognition, in Bioinstrumentation and 
Biosensor (edited by Weis D. L. ),. Marcel Dekker , Inc., 
New York (1990); Cosic I. et al. , Eur. J. Biochem. 198, 
113 (1991); Cosic I. and Hearn M.T. W. , J. Mol. 
Recognition., 4, 57 (1991); Veljkovic V. et al. , 

15 Biochem. Biophys. Res. Commun. , 189, 705 (1992); 
Krsmanovic B. et al., WO 93/17108. 

An object of the invention is provided by EPO 
muteins having an homology degree with natural 
erythropoietin lower than 60% or polypeptides having 

20 from 20 to 100, preferably from 15 to 70 amino acids, 
characterized by informational spectrum having 
substantially the same -frequencies as found in the 
informational spectrum of natural erythropoietin. 

The muteins or polypeptides according to the 

25 invention may be designed so as to include appropriate 
0- or N-glycosilation sites even though it has been 
surprisingly found that glycosilation is not always 
necessary for the biological activity. 

More particularly, the muteins or polypeptides of 

30 the invention are characterized by the frequency 
component 0.312 ± 0.004 in the informational spectrum 
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and at least one of the following frequency components: 
0.023, 0.156, 0.180, 0.185, 0.258, 0.273, 0.285, 0.363 
and 0.500 determined with the accuracy of I 0.004. 

Preferred muteins of the invention are shown in 

5 Figures 2a and 2b (Sequence Id n. 1 and 2) whereas 
preferred polypeptides are shown in Figure 3 (Sequence 
Id n. 3, 4, 5 and 6). In Figures 1, 2a and 2b, the 
predicted amphipathic a helices are underlined with 
double line (====). Residues predicted to be on the 

0 surface are underlined. Each of the three N-linked and 
one of O-linked glycosylation sites is designated by an 
asterisk. The mutations are designated by small letters. 

Another group of preferred peptides according to 
the invention and having 16-40 aminoacids, is 

5 represented by the following general formula: Xl-Tyr-X2- 
Cys-X3-X4-Gly-Pro-X5-Thr-Trp-X6-Cys-X7-Pro-X8 , where XI 
= Thr, Gin, His or Asp, X2 = Ser, Asn, Gly, Pro or Ala, 
X3 = Thr, Gin, Asp, Ser, Asri, Gly, Pro, Ala, lie, Val, 
Phe, Lys, Tyr, Met or Glu, X4 = Phe, Val or Met, X5 = 

3 Leu, He, Met or Val, X6 = Leu, He or Val, X7 = Lys, 
Arg or Asp, and X8 = Gin, Thr, His, Ser, Ala, Val or 
Leu. Optionally the peptide may be cyclised or 
dimerised. 

These peptides are able to bind to. the 
5 erythropoietin receptor and show in the IS the frequency 
component 0.312 and the corresponding amplitude A > 
0.11. 

The invention also refers to polynucleotide 
sequences coding for said muteins or polypeptides, to 
3 expression vectors comprising said polynucleotide 
sequences and to hosts transformed or transfected by 
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said vectors, as well as to nucleotide sequences which 
hybridize to the above mentioned coding sequences. 

The polypeptides sequence of the invention are 
determined by a procedure involving the following steps: 
5 1. determination of the consensus characteristic 

frequencies for the EPO molecules; 
2. derivation of a new numerical sequence of the 

desired length having the same characteristic 

frequencies using inverse Fourier transformation; 
10 3. determination of the amino acid corresponding to 

each element of this new numerical sequence from 

values of EIIP (see Table 1): 
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Table 1 



Amino acid EIIP [Ry]* 



L 


0.0000 


I 


0.0000 


N 


0.0036 


G 


0.0050 


V 


0.0057 


E 


0.0058 


P 


0.0198 


H 


0.0242 


K 


0.0371 


A 


0.0373 


Y 


0.0516 


W 


0.0548 


Q 


0.0761 


M 


0.0823 


S 


0.0829 


C 


0.0829 


T 


0.0941 


F 


0.0946 


R 


0.0959 


D 


0.1263 



25 * Ry = Rydberg unit 

The polypeptides may be obtained 'by conventional 
methods of peptide synthesis or by known recombinant DNA 
techniques. 

The polypeptides or muteins of the invention may be 
30 administered to humans or animals in form of suitable 
pharmaceutical compositions, usually but not exclusively 



WO 97/49729 



PCT/EP97/03228 



7 



to be administered parenterals . Said compositions will 
contain from 1 to about 100 mg of mutein or polypeptide 
for the treatment of the same pathological conditions 
presently treated with human or recombinant EPO. 
5 The following examples further illustrate the 

invention. 

fixample 1 

ft n n lYffjff ""■inn acid sequence t>Y the ISM 
in the first step of the ISM analysis each 
10 constitutive element (amino acid) in analyzed sequence 
is represented by corresponding EI IP value. For 
calculation of EIIP the following expression derived 
from the "general model pseudopotential" [Veljkovic V. 
and Slavic I., Phys. Rev. Lett., 21. 105 (1972); 
15 Veljkovic V., Phys. Lett., 4M, 41 (1973)] was used: 
W = 0.25 Z* sin (1.04 n Z*)/2 d> 
where Z* is the average quasi-valence number determined 

by: «*.« L »iV» (2) 

i=l 1 
20 where Z is the number of valence electrons of the i-th 
atomic component, n t - the number of atoms of the i-th 
component, m and N - the number of atomic components and 
total number of atoms in the side group, respectively. 
The values of EIIP for side groups of amino acids 
25 calculated in accord with Eg. (1) are given in Table 1. 

The numerical serie determined in this way is 
finite-lenght deterministic discrete signal containing 
information corresponding to selective long-distance 
interaction among biological macromolecules . In order to 
30 analyze this information, the obtained numerical 
sequence was subjected to discrete Fourier 
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transformation ( DFT ) , which is defined as follows 
[Rabiner R., L. and old B., Theory and applications of 
digital processing. Prentice-Hall Inc., nglewood 1975)]: 

5 X(n) = 2 ' x(m)e N , n = 1,2 N/2 (3) 

ra»0 

Here x(m) is the m-th of a given numerical series, and 
X(n) are coefficients of DFT. The coefficients are 
describing the amplitude, phase, and frequency of 
10 sinusoids from which original signal consists. The 
absolute value of complex DFT coefficients determines 
the amplitude spectrum which is in the ISM defined as 
informational spectrum (IS) and represented by the 
following equation: 

15 S(n) = X(n)X*(n) = |X(n)|2, n ** lf 2 ,N/2, (4) 

It was assumed that points in analyzed numerical 
sequences are equidistant with the distance d = 1. In 
this case the maximal frequency in the spectrum is 
F max = 1/2d = °- 5 - zt is important to note that the 
frequency range is independent of number of points in 
the sequence. The total number of points in the sequence 
(i.e. number of amino acids in the analyzed primary 
structure) influences only the resolution of IS. In the 
case of an N-point sequence, the resolution equals i/N. 
The minimal lenght of sequence that can be analyzed by 
ISM is determined by the desired resolution of the 
spectrum. Therefore, this number is determined by the 
expected number of peaks which are to be strictly 
separated and cannot be exactly defined. The minimal, 
length of sequence which can be analyzed by ISM with 
suitable accuracy is 16 amino acids. 



20 



25 



30 
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In this way, the information primarily defined by 
the sequence of symbols representing amino acids is 
presented in spectral form which is more suitable for 
mathematical analysis. It is important to note that ISM 
5 does not influence this information and represents only 
a tool for its analysis {like the prism which decomposes 
the white light in its spectral components). Bach 
frequency in the IS represents a particular 
informational component encoded in the primary structure 
10 by regularly distributed structural motifs with similar 
electronic properties. 

Example 2 

ftppl-iratinn of t^* technique in sample 3 £2 tllfi 

analysis of KPO Prot.ftinS 
15 The algorithmic procedures were applied to the 

mammalian EPO protein sequences. 

The analysis procedure comprises the following 
steps : 

1. each amino acid sequences was converted to the 
20 numerical sequence by representing each amino acid 

with the corresponding value of the EIIP; 

2. this numerical sequence was converted into a 
numerical spectrum using fast Fourier transform 
(hereinafter FFT); 

25 3. spectra were mutually compared using cross-spectral 
analysis with the aim to extract, common frequency 
components . 

From the analysis of cross-spectra of EPO molecules 
from various mammalian species (mouse, rat, rabbit, 
30 ship, monkey and human) one can deduce a set of 
characteristic frequencies which predominates in the 
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obtained CIS. In Table 2 all characteristics frequencies 



10 



S/N > 


1 are given: 




Table 2 


Fl 


0.023 


F2 


0.156 


F3 


0.180 


F4 


0.195 


F5 


0.258 


F6 


0.273 


F7 


0.285 


F8 


0.312 


F9 


0.363 


F10 


0.500 



All frequency components in Table 2 are determined 
15 with accuracy of ± 0.004. 

From the obtained results, it is possible to 
conclude that information which is essential for 
biological activity of the analyzed EPO molecules is 
completely determined with the set of characteristic 
20 frequencies given in Table 2. 

Example 3 

Application — Q± th& technique in e xample 1 to the 

analysis of KPO-RPOR interaction 

In order to determine which of the characteristic 
25 frequency components from Table 2 determines the 

information that is essential for human EPO-EPOR 

interaction, the cross-spectrum between these two 

proteins is obtained. 

The characteristic frequencies corresponding to the 
30 first 15 amplitudes in EPO-EPOR cross-spectrum are given 

in Table 3. 
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15 



20 



25 



30 
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Table 3 



F 


S/N 


0.311 


10.6 


0.258 


7.5 


0.272 


7.3 


0.113 


7.0 


0.361 


7.0 


0.498 


5.2 


0.430 


5.1 


0.158 


5.1 


0.031 


4.6 


0.382 


4.6 


0.154 


4.4 


0.283 


4.3 


0.275 


3.9 


0.347 


3.9 


0.008 


3.5 



The results presented in Table 3 show that the main 
part of information corresponding to human EPO-EPOR 
interaction . is determined by the frequency component 
0.311. It is also important to. note that, taking into 
account accuracy of ± 0.004 in the determination of 
frequency values, 8 of 10 characteristic frequencies 
from the EPO CIS (Table 2) are also contained within the 
first 15 characteristic" frequencies in the human EPO- 
EPOR cross-spectrum (Table 3). 

Example 4 

Application of th t > tPr.hnioue in example 1 to tiffffiqn of 
EPO muteins 

Once the characteristic frequencies for EPO protein 
family had been determined, it was possible to design 
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EPO muteins introducing a large number of amino acid 
substitutions in human EPO. The main condition that must 
be satisfied in the design of these muteins is the 
conservation of IS of human EPO. 
5 In Figure 2a, the primary structure of mutein-1 

generated by substitution of 99 (56.6%) amino acids in 
human EPO is given (Sequence Id n. 1). The IS of this 
mutein is given in Table 4 and Figure 4a. As can be 
seen, this IS contains all 10 characteristic EPO 
10 frequencies from Table 2, as well as other frequency 
components, corresponding to first 15 amplitudes in IS of 
the human EPO. 

Table 4 

IS (EPO) IS (mut. 1) IS (mut. 2) 

15 0.359 0.359 0.359 

0.500 0.156 0.500 

0.156 0.500 0.156 

0.195 * 0.285 0.285 

0.258 0.258 0.258 

20 0.285 0.195 0.195 

0.203 0.203 ~ 0.273 

0-273 0.273 0.203 

0.180 0.402 0.180 

0.004 0.180 0.301 

25 0.445 0.004 0.004 

0.113 0.113 0.211 

0.023 0.172 0.113 

0.312 0.211 0.312 

0.050 0.312 0.402 

30 in order to design the mutein that with higher 

probability will express EPO activity, also structural 
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characteristics that are important for this biological 
activity must be taken into account [Boissel JP. et al., 
J. Biol. Chem., 268, 15983 (1993); Wen D. , et al., J. 
Biol. Chem., 269, 22839 (1994)]. In Figure 2b (Sequence 

5 Id n. 2), the primary structure of mutein-2 generated by 
substitution of 72 (43.4%) amino acids in human EPO is 
given. This mutein besides conserved IS of human EPO 
(Table 4 and Figure 4b) also has preserved all 
structural elements that are important for the 

0 biological activity of the human EPO, including all 
glycosilation sites [Wen D., et al., J. Biol. Chem. 269, 
22839 (1994)]. The comparison of the alpha and beta 
propensity , hydrophilicity , hydrophobicity , solvent 
accessibility, antigenicity and. secondary structure of 

5 the human EPO and mutein-2 are given in Figure 5-6. 
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SEQUENCE LISTING 
(1) GENERAL INFORMATION: 

5 (i) APPLICANT: 

(A) NAME: DIAPHARM LIMITED 

(B) STREET: Quay House, South Esplanade 

(C) CITY: St. Peter Port 

(D) STATE: Guernsey 

10 ( E) COUNTRY : Channel Islands 

(F) POSTAL CODE (ZIP): GY1 4EJ 

(i) APPLICANT: 

(A) NAME: MARKOVIC Dejan 
15 (B) STREET: Via Andrea Verga, 5 

(C) CITY: Milano 

(D) STATE: Italy 

(E) COUNTRY: IT 

(F) POSTAL CODE (ZIP): 1-20144 

20 

(ii) TITLE OF INVENTION: POLYPEPTIDES MIMICKING THE 

ACTIVITY OF HUMAN ERYTHRO- 
POIETIN 

2 5 (iii) NUMBER OF SEQUENCES: 6 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version 



WO 97/49729 



PCT/EP97/03228 



15 

#1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: 1: 



0 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



5 Trp Gly Gly Arg Val Val Cys Asp Thr Arg He Val Glu Arg 
1 5 10 

Tyr Val Val Glu Trp His Glu Trp Glu Asn He Ser Ser Pro 
15 .20 25 

Cys Trp Glu Lys Cys Thr Val Asn Glu Asn Val Ser He Gly 
0 30 35 40 

Asp Ser His lie Asn Phe Tyr Trp Ala His Arg Met Glu He 
45 50 55 

Pro Gin Gin Trp Leu Glu He Ala Gin Pro Val Trp Val Val 
60 65 70 

5 Thr Glu Trp He Val Arg Pro Gin Trp Val Val Val Asn Ser 

75 80 

Thr Gin Gly Ala Glu Gly Val Gin Val Lys Leu Asp His Trp 
85 90 95 

He Thr Pro Val Arg Thr Val Ser Ser Val Val Arg Trp Val 
!0 100 105 HO 

Pro Trp Gin His Glu Trp Val Thr Gly Gly Asp Ala Ala Ser 
115 120 125 

Ala Ala Gly Val Arg Ser Val Ser Trp Asp Ser Phe Arg His 
130 135 I 40 
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Val Phe Arg Leu Tyr Thr Asn Phe Val Arg Gly His Val His 
145 150 

Val Tyr Ser Gly Glu Trp Cys Arg Ser Pro Asp Arg 
155 160 165 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 
10 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



15 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Trp Gly Gly Arg Val Val Cys Asp Ser Arg Val Leu Glu Arg 
1 5 10 

20 Tyr Leu Leu Glu Ala His Glu Ala Glu Asn He Thr Ser Pro 
15 20 25 

Cys Trp Glu Lys Cys Thr Val Asn Glu Asn He Thr He Pro 
30 35 40 

Asp Ser His He Asn Phe Tyr Trp Ala His Arg Met Glu He 
25 45 50 55 

Pro Gin Gin Ala Leu Glu Leu Trp Gin Gly Ala Leu Leu Leu 
60 65 70 

Thr Glu Ala Leu Val Arg Pro Gin Trp Val Leu Val Asn Ser 
75 80 85 

30 Ser Gin Gly Ala Glu Gly Leu Gin Leu His Leu Asp His Ala 

90 95 

Leu Thr Pro Leu Arg Thr Leu Ser Ser Val Val Arg Trp Val 
100 105 no 
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Pro Trp Gin His Glu Trp Val Thr Gly Gly Asp Ala Ala Ser 
115 120 125 

Ala Ala Gly Val Arg Ser Leu Ser Ala Asp Ser Phe Arg Lys 
130 135 140 

5 Val Phe Arg Leu Tyr Thr Asn Phe Leu Arg Gly His Val His 

145 150 

Val Tyr Ser Gly Glu Trp Cys Arg Ser Gly Asp Arg 
155 160 165 



10 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 amino acids 
(B-) TYPE: amino acid 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Pro Gin Thr Leu Val Asn Ser' "Ser Tyr Gin Lys His Asp 
1 5 10 

Tyr His Asp Pro Leu Asp Ala His Asp His Glu 
25 15 20 25 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 35 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



WO 97/49729 



18 



PCT/EP97/03228 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Phe His Lys Arg Trp Ala Ala Ser Ala Ala Gin Trp Trp Thr 
1 5 io 

Ala His Arg Trp Met Phe Gly Tyr Asp Trp Lys Gin His Trp 
15 20 25 

10 Asp Glu Asn He Asp Gin He 

30 . 35 



(2) INFORMATION FOR SEQ ID NO: 5: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



20 



(ii) MOLECULE TYPE: protein .__ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



25 Thr Gin Glu Trp Ala Gin Met Ala Ser Ala Phe Ala Trp Arg 
1 5 10 

Ser His Arg Gin Ala Glu Asn He Asp Ala His Thr Glu Gin 
15 20 25 

Thr Ala His Lys Asp Ser Lys Met Gin Leu Ser Phe Lys Leu 
30 30 35 40 

Met Gin Thr Trp His Ala Arg Gin Trp Ala Ala Ser Ala Ala 
45 50 55 
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Glu Trp Asp Gin Met Gin Leu Gin 
60 



(2) INFORMATION FOR SEQ ID NO: 6: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 55 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

15 

Gin Tyr Tyr Thr Lys Trp Trp Thr Gin Gin Gin Ala Tyr Asp 
15 10 

Thr Tyr Cys Gin Tyr Gin His Met Thr Val Asn Ser Lys Trp 
15 20 25 

20 Arg Gin Leu His Asp Arg His Trp Trp Pro Gin Arg Pro Trp 
30 35 40 

Tyr Trp Gin Ala His Met Cys Trp Tyr Trp Cys Gin Gin 
45 50 55 
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CLAIMS 



1. A polypeptide mimicking the activity of human 
erythropoietin, having an amino acid sequence which is 
5 different from that of human erythropoietin and presents 
substantially the same frequencies as natural 
erythropoietin in the information spectrum obtained by 
Fourier transformation according to the method of 
informational analysis. 

10 2. A polypeptide according to claim 1, which is an 
erythropoietin mutein having an homology degree with 
natural human erythropoietin lower than 60%. 
3. A polypeptide according to claim 1 or 2, having 
from 15 to 70 amino acids. 

15 4. a polypeptide according to any one of claims 1 to 
3, haying the frequency component 0.312 ± 0,004 in the 
informational spectrum and at least one of the following 
frequency components: 0.023, 0.156, 0.180, 0.195, 0.258, 
0.273, 0.285, 0.363 and 0.500 the accuracy being ± 

20 0.004. 

5. A polypeptide according to any preceding claim, 
having any of the primary sequences of sequences Id n. 
1-6. 

6. A polypeptide according to claim 1 including the 
25 amino acid sequence: Xl-Tyr-X2-Cys-X3-X4-Gly-Pro-X5-Thr- 

Trp-X6-Cys-X7-Pro-X8, where XI = Thr, Gin, His or Asp, 
X2 = Ser, Asn, Gly, Pro or Ala, X3 = Thr, Gin, Asp, Ser, 
Asn, Gly, Pro, Ala, He, Val, Phe, Lys, Tyr, Met or Glu, 
X4 = Phe, Val or Met, X5 = Leu, lie, Met or Val, X6 = 
30 Leu, lie or Val, X7 = Lys, Arg or Asp, and X8 = Gin, 
Thr, His, Ser, Ala, Val or Leu. 
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7. A polynucleotide coding for a polypeptides 
according to any preceding claim. 

8. Ah expression vector comprising a polynucleotide 
according to claim 6. 

5 9. A pharmaceutical composition containing a 
polypeptide according to any of claims 1 to 5 in 
admixture with one or more suitable carriers or 
excipients therefor. 

10. A peptide having human EPO activity but which is 
0 not natural human EPO, substantially as described 
herein. 
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0 5 10 15 20 25 30 
amino acid position 




40 45 50 55 60 65 
amino acid position 




,4 100 105 110 115 120 125 130 
amino acid position 
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